Picture for Zijie Zhou

Zijie Zhou

Hyperbolic and Evidence-Prioritized Experts for Large Vision-Language Models

Add code
May 29, 2026
Viaarxiv icon

AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems

Add code
May 20, 2026
Viaarxiv icon

A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints

Add code
May 06, 2026
Viaarxiv icon

PolarMem: A Training-Free Polarized Latent Graph Memory for Verifiable Multimodal Agents

Add code
Jan 31, 2026
Viaarxiv icon

Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving

Add code
Jan 29, 2026
Viaarxiv icon

A Universal Load Balancing Principle and Its Application to Large Language Model Serving

Add code
Jan 25, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

Adaptively Robust LLM Inference Optimization under Prediction Uncertainty

Add code
Aug 20, 2025
Figure 1 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 2 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 3 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 4 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Viaarxiv icon

LLM Serving Optimization with Variable Prefill and Decode Lengths

Add code
Aug 08, 2025
Viaarxiv icon

LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition

Add code
Apr 27, 2025
Viaarxiv icon